Applying pitch connection control in Mandarin speech synthesis

نویسندگان

  • Yi Zhou
  • Yiqing Zu
  • Zhenli Yu
  • Dongjian Yue
  • Guilin Chen
چکیده

In this paper, a novel tone-based pitch connection control in unit selection is described to improve naturalness of output speech for Mandarin text-to-speech (TTS) baseline system. This study mainly focuses on pitch connections of concatenative syllables. To improve the concatenation quality, we apply offset pitch of preceding syllable and onset pitch of following syllable in unit selection. According to the statistical result on corpus, three types of pitch connection constraints are proposed. Based on the property of pitch connection constraint, corresponding tone-based cost functions play important role in unit selection for continuity improving at concatenation point. By applying the defined cost functions in unit selection, more suitable units are selected and more natural-sounding synthesized speech is achieved.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Perceptual relevance of pitch contours of Mandarin tones and its efficacy in prosody generation of speech synthesis

Modeling Mandarin tones is one of the most important issues in speech synthesis. However, established knowledge is mainly focused on the “production” aspect. In this paper, we first characterized relative pitch levels of tones. Next, two perceptual experiments were designed to investigate “perceptual” relevance of pitch levels and shapes in Mandarin. Results showed that relative pitch levels of...

متن کامل

Modeling Pitch Contour of Chinese Mandarin Sentences with the PENTA Model

In continuous speech, the pitch contour of the same syllable may vary much due to its contextual information. The Parallel Encoding and Target Approximation (PENTA) model is applied here to Mandarin speech synthesis with a method to predict pitch contours for Chinese syllables with different contexts by combining the Classification And Regression Tree (CART) with the PENTA model to improve its ...

متن کامل

Hierarchical stress modeling and generation in mandarin for expressive Text-to-Speech

Expressive speech synthesis has received increased attention in recent times. Stress (or pitch accent) is the perceptual prominence within words or utterances, which contributes to the expressivity of speech. This paper summarizes our contribution to Mandarin expressive speech synthesis. A novel hierarchical stress modeling and generation method for Mandarin is proposed and further integrated i...

متن کامل

Modelling and Decision Tree Based Prediction of Pitch Contour in Ibm Mandarin Speech Synthesis System

In this paper, a method of pitch contour modelling based on the hidden Markov model (HMM) states of an acoustic unit is presented. A pair of vectors is computed from the alignment of the speech data with the acoustic unit’s HMM states. The pitch contour feature of the acoustic unit is represented by the vector pair so that the variants of the acoustic unit’s pitch contour can be measured and co...

متن کامل

Modeling Pitch Contour of Chinese Mandarin Sentence with PENTA Model

In continuous speech, it is believed that the pitch contour of the same syllable may vary a lot due to its different context information. To apply the Parallel Encoding and Target Approximation (PENTA) model to Mandarin speech synthesis and improve its prediction accuracy, this paper proposed a method to predict pitch contours for Chinese syllables with different contexts by combining the Class...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004